Goto

Collaborating Authors

 reference picture


PhotoBot: Reference-Guided Interactive Photography via Natural Language

Limoyo, Oliver, Li, Jimmy, Rivkin, Dmitriy, Kelly, Jonathan, Dudek, Gregory

arXiv.org Artificial Intelligence

We introduce PhotoBot, a framework for automated photo acquisition based on an interplay between high-level human language guidance and a robot photographer. We propose to communicate photography suggestions to the user via a reference picture that is retrieved from a curated gallery. We exploit a visual language model (VLM) and an object detector to characterize reference pictures via textual descriptions and use a large language model (LLM) to retrieve relevant reference pictures based on a user's language query through text-based reasoning. To correspond the reference picture and the observed scene, we exploit pre-trained features from a vision transformer capable of capturing semantic similarity across significantly varying images. Using these features, we compute pose adjustments for an RGB-D camera by solving a Perspective-n-Point (PnP) problem. We demonstrate our approach on a real-world manipulator equipped with a wrist camera. Our user studies show that photos taken by PhotoBot are often more aesthetically pleasing than those taken by users themselves, as measured by human feedback.


Pik-Fix: Restoring and Colorizing Old Photos

Xu, Runsheng, Tu, Zhengzhong, Du, Yuanqi, Dong, Xiaoyu, Li, Jinlong, Meng, Zibo, Ma, Jiaqi, Bovik, Alan, Yu, Hongkai

arXiv.org Artificial Intelligence

Restoring and inpainting the visual memories that are present, but often impaired, in old photos remains an intriguing but unsolved research topic. Decades-old photos often suffer from severe and commingled degradation such as cracks, defocus, and color-fading, which are difficult to treat individually and harder to repair when they interact. Deep learning presents a plausible avenue, but the lack of large-scale datasets of old photos makes addressing this restoration task very challenging. Here we present a novel reference-based end-to-end learning framework that is able to both repair and colorize old, degraded pictures. Our proposed framework consists of three modules: a restoration sub-network that conducts restoration from degradations, a similarity network that performs color histogram matching and color transfer, and a colorization subnet that learns to predict the chroma elements of images conditioned on chromatic reference signals. The overall system makes uses of color histogram priors from reference images, which greatly reduces the need for large-scale training data. We have also created a first-of-a-kind public dataset of real old photos that are paired with ground truth ''pristine'' photos that have been manually restored by PhotoShop experts. We conducted extensive experiments on this dataset and synthetic datasets, and found that our method significantly outperforms previous state-of-the-art models using both qualitative comparisons and quantitative measurements. The code is available at https://github.com/DerrickXuNu/Pik-Fix.


HP Labs teams robotics with machine learning

#artificialintelligence

"Machine learning and robotics are a perfect match," suggests HP Fellow Will Allen. Although experts in the one field rarely stray into the other, Allen says, their potential synergies are real. "Machine learning is very applicable to robotics, and robotics--by which I mean working with physical robots--needs some of the things that machine learning is good at," he argues. Now Allen, who has a background as a distinguished innovator in imaging and printing technologies, is co-leading a research team with colleague David Murphy in HP's Emerging Compute Lab that aims to understand, and potentially harness, those synergies to create a new generation of what the team are calling "Smart Machines." One of the main challenges in robotics--where you want electro-mechanical machines to perform specific tasks with some degree of autonomy--is to have the machines move both precisely and efficiently in 3D space.